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Abstract 

An unknown positive number of items arrive at independent uni- 
formly distributed times in the interval [0, 1] to a selector, whose task 
is to pick online the last one. We show that under the assump- 
tion of an adversary determining the number of items, there exists 
a game-theoretical equilibrium, in other words the selector and the 
adversary both possess optimal strategies. The probability of success 
of the selector with the optimal strategy is estimated numerically to 
0.352917000207196. 



1 Introduction 

An unknown number n > 1 of items are presented at independent uniformly 
distributed times in the interval [0, 1] to a selector, whose task is to choose 
online the last one. This is related to so-called parking and house- selling 
problems in the theory of optimal stopping, but has a more adversarial flavor. 
The problem was originally motivated by a generalization of the secretary 
problem to partially ordered sets (which we discuss in Section |5]), but in view 
of the simplicity seemed worthy of study in its own right. 

We begin with some observations showing that the probability of success 
with a good strategy is a nontrivial constant. In Section [3] we show that both 
the selector and the adversary possess optimal strategies. In Section H] we 
show how to compute numerically the probability of success for the selector 
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at game-theoretical equilibrium. Finally in Section O we explain the origin 
of the problem and its relation to the partially ordered secretary problem. 

Acknowledgments. This paper has grown out of discussions in coffee 
breaks at conferences in the last ten years. I wish to thank in particular 
David Aldous, Thomas Bruss, Ragnar Freij, Svante Janson, Joel Spencer 
and Peter Winkler. 

2 Basic observations 

Starting from the selector's point of view, we wish to maximize the probabil- 
ity of picking the last item under a worst-case scenario. We first give a simple 
proof that there is a strategy for the selector that achieves a probability of 
success bounded away from zero. 

Proposition 2.1. There is a strategy by which the selector succeeds with 
probability at least some constant c > for every n. 

Proof. The selector starts by observing the number k of items arriving in 
[0, 1/2]. The idea is to estimate the total number of items to roughly 2k, 
in which case the remaining arrival times can be approximated by a Poisson 
point process of rate 2k. In particular we expect that with probability roughly 
1/e, the interval [1 — l/{2k), 1] will contain exactly one item. Our strategy 
is therefore to wait until time 1 — 1/(2A;) and accept the next one. 

Since we do not have an a priori probability distribution on n, we cannot 
talk about the probability distribution of n conditioning on k. Instead we 
must fix the selector's strategy and analyze it for arbitrary n. With the 
trivial modification of accepting the first item if = 0, the strategy will 
clearly achieve positive probability of success for each n, and therefore it 
suffices to analyze it for large n. 

For e > we have, by the law of large numbers. 



with high probability as n — t- oo. Conditioning on k and supposing that ([T]) 
holds, the probability that exactly one of the remaining n — k items arrives 
after time 1 — l/{2k) is 
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Therefore as n — )■ oo, the probabihty of success will converge to 1/e, 
which establishes the claim. □ 



Thomas Bruss has pointed out that the odds-algorithm [H [2] provides 
a quite good heuristic for the selector. Although the criteria of the main 
theorem of are not met, it suggests a strategy which accepts the k:th 
item if it arrives after time 1 — l/{k + 1). This strategy gives a success 
probability of at least 5/16, the minimum occurring for n = 3, but we have 
no computation-free proof of this. 

The proof we have given here of Proposition 12.11 essentially reduces the 
problem to choosing online the last event of a Poisson process of known rate. 
The idea of treating the arrival times as a Poisson point process also forms 
the basis for an informal argument that shows (in a way which can be made 
rigorous) that the selector cannot achieve uniformly in n a better success 
probability than 1/e. 

We think of the number n as chosen by an adversary, the devil. The devil 
can choose n from any probability distribution, and in particular may choose 
n according to Poisson(A)-distribution. This means that the arrival times 
will be the times of the events in a rate A Poisson point process on [0, 1]. 
There is a small technicality to resolve: The devil is strictly not allowed to 
choose n = 0, but we can modify the rules of the game so that the selector 
wins by default if n = 0. 

The selector wins by default with probability e^'^, and must otherwise 
choose online the last event of the Poisson process whose rate A we can 
even assume to be known to the selector. We will present a more general 
argument in detail later, but for the moment let us assume that the only 
reasonable thing the selector can do is to choose a point in time and decide 
to accept the next item. Then the selector wins if the number of remaining 
items is exactly 1. If the expected number of remaining items is x, then the 
probability of exactly one is xe~^, which attains a maximum of 1/e for x = 1. 
If our assumption about the strategy of the selector is correct, the devil can 
therefore keep the selector's winning probability below any number which is 
greater than 1/e. 

At this point we present an erroneous argument which pretends to demon- 
strate that, as for the classical secretary problem as well as the setting of [1], 
a success probability of 1/e is achievable. This caused me some confusion, 
especially at the point when another argument seemed to establish the op- 
posite. The way it is presented here the argument lacks rigor in several 
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respects, but as we will later see, the frivolous assumption of the existence of 
a game-theoretical equilibrium is not the most serious issue. Although incor- 
rect, the argument may be of some interest as it reveals a certain paradoxical 
nature of the problem. I apologize to readers who are unable to follow it, 
but explaining it in greater detail is actually impossible. 

Erroneous Conclusion 2.2. There is a strategy for the selector that suc- 
ceeds with probability at least 1/e regardless of n. 

"Proof". First observe that an optimal strategy for the devil cannot be to 
choose n from a distribution with finite support. If it were, then the selector's 
optimal strategy would always accept the A^:th item, where N is the largest 
value of n chosen with positive probability. But then the devil could switch 
strategy and instead choose n = N + 1 and always win. 

Hence the devil will choose arbitrarily large values of n with positive prob- 
ability. But if n is large, the strategy described in the proof of Proposition [2]T] 
will achieve winning probability tending to 1/e. Since at game-theoretical 
equilibrium, all values of n in the devil's strategy must be equally hkely to 
win, the selector's winning probability must be at least 1/e. □ 

After this introduction, where our arguments have deteriorated from sub- 
optimal to non-rigorous to incorrect, let us start over again and be more 
careful. 

3 Existence of optimal strategies 
3.1 A card game 

We wish to apply the von Neumann-Morgenstern equilibrium theorem, and 
therefore start by studying a similar game where each player has only finitely 
many pure strategies. 

In this game there is a deck of d cards, and at the start of the game the 
devil labels the face side of some of them (labels representing items). There 
must be at least one labeled card, and at most A^. The deck is then shuffled 
and the cards are turned up one by one. The selector, knowing d and A^, 
must select online the last labeled card. 

The card game has only finitely many pure strategies for each of the play- 
ers, and therefore by the von Neumann-Morgenstern theorem has a strategic 
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equilibrium. Next we establish some properties of the equilibrium strategies. 
We assume throughout that d > N. Notice that the devil's strategy is simply 
a probability distribution on {1, . . . , N}. 

Lemma 3.1. At any equilibrium in the card game, the devil's strategy gives 
positive probability to each of the numbers 1, . . . ,N. 

Proof. First notice that at equilibrium, the selector must win with positive 
probability, since there is a trivial strategy that wins with probability 
Just make a guess on the number of labeled cards from uniform distribution 
on {l,...,iV}. 

It follows that at equilibrium, the selector's strategy must have the prop- 
erty that for every k < N, it may happen that the h.th labeled card is 
accepted. By this we mean that there is some sequence of cards for which 
with positive probability the selector's strategy accepts the A::th labeled card, 
but no earlier labeled card. If the selector's strategy didn't have this prop- 
erty, then the devil could switch to a strategy of always choosing exactly k 
labeled cards, which would then win with probability 1. 

Now suppose for a contradiction that at an equilibrium, the devil's strat- 
egy gives probability to some number k < N. We want to show that the 
selector can change strategy and strictly increase his winning chances. There 
are two cases: If there is some number greater than k which is chosen by the 
devil with positive probability, then let I be the smallest such number. The 
change of strategy will consist in never accepting the k:th labeled card, and 
instead (whenever the original strategy dictated accepting the k:th labeled 
card) waiting for the /:th labeled card and accept that. 

If on the other hand there is no number greater than k which is chosen 
by the devil with positive probability, then let m < k be the largest number 
which is. The change of strategy will now consist in always accepting the 
m:th labeled card. Since the original strategy sometimes waits for the k:th 
labeled card, this will give a strictly increased winning probability. 

Contrary to assumption we are not at equilibrium, and this contradiction 
establishes the claim. □ 

Corollary 3.2. At any equilibrium in the card game, the selector's strategy 
has the property that it wins with the same probability for all n = 1, . . . , A^. 

We can assume without changing the players' winning chances that the 
selector, when deciding on the action on a labeled card, can take into account 
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only the number of cards drawn earlier and the number of labeled cards 
among them, but not the actual positions (times) of the earlier labeled cards. 
The selector's strategy can then be described by functions fi, . . . , f^, where 
fk : {k, . . . , m} — )■ [0, 1], describes the probability fk{i) of accepting the k:th. 
labeled card if it arrives as the r.th card turned up. 

The next lemma shows that at equilibrium, each will have a sharp 
transition from to 1 going via at most one value strictly between and 1. 

Lemma 3.3. For 1 < k < N and k < i < m, if fk{i) > 0, then fk{i + l) = 1. 

Proof. If k = N then fk must be identically 1 since by Corollary 13.21 the 
selector's strategy must optimize the winning chances against each possible 
value of n. 

Suppose therefore that k < N and assume that fk{i) > 0. Then, having 
fixed the devil's strategy and optimizing for the selector, accepting the k:th 
labeled card at position i is either better than rejecting, or indifferent. Since 
k < N, there is a positive probability that card i + 1 is labeled. If it is, then 
the selector will have positive probability of winning if card i is rejected, but 
obviously zero probability of winning if card i is accepted. Conditioning on 
card i + 1 being labeled, the selector is strictly better off rejecting card i 
than accepting. Therefore conditioning on card i + 1 not being labeled, the 
selector is strictly better off accepting card i than rejecting it. Now observe 
that the situation after drawing the k:th labeled card as card i, conditioning 
on card i + 1 not being labeled, is the same as if drawing the k:th labeled 
card as card i + 1. 

This shows that the selector must strictly prefer accepting the k:th card 
in position i + 1 over rejecting it, which implies that fk{i + !) = !. □ 

3.2 Reducing the original game to the card game 

We wish to draw conclusions about the original game, which in the following 
we call the last- arrival game, from the analysis of the card game. Starting 
from the last-arrival game, the following is a sequence of games that become 
more and more advantageous to the selector: 

(1) We choose a number and restrict the devil's choices to 1 < n < A^. 

(2) We choose a number d and divide the unit interval into d equal time- 
slots. We allow the selector to wait until the end of the current time-slot 
before deciding whether to accept a newly arrived item. 
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(3) Moreover, we decide that the selector wins by default if any two items 
arrive in the same time-slot (even if the selector had previously accepted 
an item). 

(4) Instead, we first generate independent arrival times ti, . . . ,t]\[ and 
declare the selector winner if any two of those fall in the same time-slot. 
If not, then after the devil chooses a number n, a random subset of n 
of the times ti, . . . , t^v are chosen as arrival times of the n items. 

(5) Before the game, a biased coin is flipped, and with probability {^)/d, 
the selector is declared winner. If not, then the (A^, (i)-card game is 
played. 

Notice that (^) /d is an over-estimate of the probability that two of the 
numbers ti, . . . ,t]\f fall in the same time-slot. Moreover, conditioning on no 
such collision will reduce the game (4) to the card game. 

3.3 The restricted game 

The game in (1) above, which is continuous time but with the devil's choices 
restricted to n < N will be called the N -restricted game. By a threshold 
strategy for the selector, we mean a strategy given by a sequence ai, . . . ,aN, 
where the k:th item is accepted if and only if it arrives after time at- 

Proposition 3.4. For every N, there is a threshold strategy which is optimal 
for the selector in the N -restricted game. 

Proof. We think of as fixed throughout the argument. Lemma [3.31 shows 
that for each d, the (A^, d)-caid game can be played optimally by what is 
essentially a threshold strategy in the discrete setting, the only small crux 
being that randomization may be needed if the k:th labeled card arrives 
exactly at the threshold position. 

Let ak{d) = i/d, where i is the smallest value for which fk{i) > in 
an optimal strategy for the (A^, (i)-card game as in Lemma 13.31 By playing 
the restricted game using the threshold strategy given by ai{d), . . . ,aN{d) 
for large d, the selector will be able to achieve a winning probability in the 
restricted game approaching that of the game (5) in the list. 

If we regard a{d) = {ai{d), . . . ,aN{d)) as a point in the A^- dimensional 
unit cube, then as ci — cxd, the points a{d) must have an aggregation point. 
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and by continuity of the selector's worst case expected payoff under a thresh- 
old strategy as a function of the thresholds, the aggregation point must cor- 
respond to an optimal strategy for the selector in the A'"-restricted game. □ 

Therefore, to obtain an upper bound on the selector's winning probability 
in the last-arrival game, it suffices to establish an upper bound for threshold 
strategies in the A^-restricted game for some A^. 

3.4 Canonical thresholds 

Suppose that for a fixed and some number < < 1, we try to con- 
struct a threshold strategy that achieves a probability of success of at least 
6 for every n < N. Then there is a canonical way of recursively computing 
thresholds Oi, . . . , ojv such that if any sequence of thresholds achieves winning 
probability at least 9, then oi, . . . , ojv does. 

Suppose we have fixed ai, . . . , flfc-i- Then if there is a value for that 
gives probability at leat 9 of success for n = k, then there is a maximal such 
value of ttk, and for that value of Uk, the probability of success is exactly 9. 

The choice of obviously does not affect the probabihty of success for 
n < k. For n > k, the probability of success will increase with larger a^, 
since a larger decreases the risk of accepting an item too early. Therefore 
after fixing oi, . . . ,ak-i, we might as well set to the unique value that 
gives success probability exactly 9. If we do this consistently, starting with 
Oi = 1—9, then we will either get stuck at some point because not even Uk — 
gives winning probability ^ for n = A;, or we find a sequence of thresholds 
that works. 

For each k, the threshold (to the extent it can be defined) is a decreas- 
ing function of 9, but does not depend on N. For each N there is a maximal 
achievable success probability ^jv, characterized by ajv = 0. The sequence 
^AT is decreasing with A^, and therefore has a limit 9opt as N ^ oo. 

We can now establish the existence of an optimal strategy for the selector 
in the last- arrival game. 

Theorem 3.5. There is a threshold strategy which is optimal for the selector 
in the last-arrival game. 

Proof. For each 9 > 9 opt there is some A^ for which 9n < 9, which means 
that success probability 9 is impossible even in some restricted games. On 
the other hand, for 9 — 9opt, the sequence of thresholds can be computed 
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indefinitely without getting stuck, and therefore success probabihty 6opt is 
achievable with a threshold strategy in the last-arrival (unrestricted) game. 

□ 

3.5 Optimal strategy for the devil 

The fact that the devil has an optimal strategy does not seem to follow from 
any "soft" argument. Conceivably the selector's task could become more 
difficult the larger the number of items. But we already have indications 
that such a situation would entail 9opt = 1/e. We will make this argument 
precise, and therefore the first step towards proving the existence of an op- 
timal strategy for the devil is to show that contrary to the "conclusion" of 
the introduction, 6opt is strictly smaller than 1/e. An independent proof of 
this based on numerical calculation is given in Section 14.21 

Proposition 3.6. 

9opt < 1/e. 

Proof. Clearly it suffices to consider the selector's optimal strategy. Let 
be the thresholds for this strategy. First observe that — )■ 1 as A; — )■ cxo. 
The reason is that if is too small, the selector will accept too early when 
n = k + 1. It follows that there are infinitely many records in the sequence, 
by which we mean such that ak > ai for every i < k. 

If is a record, then in order for the selector to succeed when n = k, there 
must be exactly one item arriving after time a^. This shows that 6opt < 1/e, 
and if 9opt = 1/e, we must have ak = 1 — 1/A; + o{l/k) whenever ak is a 
record. Now let and a„ be records for large m, n such that n > 2m. Since 
ttm = 1 — l/m + o(l/m) and a„ = 1 — l/n + o{l/n), there must be a record ak 
for some k in the interval m < k < n such that ak — o-fc-i > 1/A;^ — o{l/k'^). 

Now consider the winning probability for n = k. The probability of 
exactly one item arriving in the interval [ak, 1] is maximized when ak = 1—1/ k 
and therefore cannot be greater than 

(l - If ^ exp [ik - 1, log (l - i)) ^ e- (l + 4) H- O (1) , 

as — 7- oo, by the Taylor expansion of exp((l/a;— 1) log(l— x)) where x = 1/k. 

Now condition on exactly one item arriving after time a^. Then the 
strategy will still fail if there is an item arriving in the interval [ak-i,ak]. 
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The probability of this is at least 1/k — o{l/k). Hence the selector's winning 
probability for n = k is at most 



-1 
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^^2k~k 
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□ 



Theorem 3.7. The devil has an optimal strategy for the last- arrival game. 

Proof. For every N, it follows by compactness of the set of strategies that 
there is an optimal strategy for the devil in the A^-restricted game. The 
optimal strategy is simply a probability distribution on {1,...,A^}. It is 
easy to verify that if a sequence of optimal strategies for the iV-restricted 
games have an aggregation point in the total variation metric, then that 
aggregation point must represent a strategy for the last-arrival game which 
keeps the selector's success probability to 9opt, and which is therefore optimal 
for the devil. 

In order to prove the existence of such an aggregation point, it suffices, in 
view of the principle of dominated convergence, to prove that for every e > 
there is a C such that no optimal strategy of any A^-restricted game assigns 
probability more than e to the set of numbers {n : n > C}. 

Obviously it suffices to find a C such that only finitely many A^-restricted 
games have optimal strategies that give probability more than e to {n : n > 
C'}, since we can then choose C beyond the largest of those values of N. 

Therefore assume for a contradiction that there is some e > such that 
for every C there are arbitrarily large for which some optimal strategy in 
the A^- restricted game gives probability e or more to {n > C}. This means 
that, eliminating A^ from the argument and considering the last-arrival game, 
for every C and every 9 > 9 opt, the devil can find a strategy that assigns 
probability at least e to {n > C} and keeps the selector's success probability 
to at most 9. 

We choose 

9 = 9opt + 2 ' (^e ^ ^opt^ ■ 

By Proposition 13. 6[ 9 > 9opt- Now for every mo and every 5 > 0, we can find 
an m > mg and a strategy for the devil that keeps the selector's winning 
probability below 9, and which assigns weight at most 5 to {m + 1, . . . , 2m}, 
and at least e to {n : n> 2m}. Indeed, if we choose C > tuq ■ 2^/^, then not 
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all of the intervals mo2* <n < m^T^^^ for i < 1/5 can get weight more than 



The idea is now to construct a strategy for the selector that "sacrifices" 
the values of n in the interval m < n < 2m, and in return obtains success 
probability close to 1/e for all n > 2m. This is done by first observing the 
number of items in the interval [0, 1/2] (here the number 1/2 is an arbitrary 
number smaller than every threshold of the selector's optimal strategy). Our 
strategy will never guess that n is in the interval m < n < 2m, and the first 
step is to decide, based on the number of items arriving in [0, 1/2], between 
the two options n < m and n > 2m. Provided S is small enough and mo 
(and thereby m) is large enough, this decision can be made with an error 
probability which is as small as we please. 

If we find that n < m, then we play normally, which means that con- 
ditioning on 77, < m we achieve a winning probability as close as we please 
to 6opt- If on the other hand we find that n > 2m, then we switch to the 
strategy used in the proof of Proposition 12.11 namely to accept the first item 
arriving after time 1 — 1/(2 A;), where k is the number of items observed in 
[0, 1/2]. This gives the selector a conditional probability of success close to 
1/e > ^ in case n actually is larger than 2m. 

In all, this means that we can construct a strategy for the selector which 
achieves success probability as close as we please to 



4 Computing 9opt 

In this section we show how we have computed numerically the probability 
6opt of success for the selector with the optimal strategy, while providing the 
Maple code that was used. We obtain the bounds 



4.1 The P-polynomials 

We already described the idea of prescribing a value for 6, which recursively 
defines thresholds that achieve success probability 6 for the selector, if pos- 



6. 




opt > 



a contradiction. 



□ 



0.3529170002071955 < Oor^t < 0.3529170002071958. 
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sible. Here we show how to actually compute these thresholds numerically. 
The calculations turn out to involve certain polynomials which we describe 
first. 

Let [n] = {l,...,n}. We define the polynomial Pn in the variables 
xi,...,Xn by 



where the sum is taken over all / : [n] — > [n] such that for 1 < i < n, 



In other words, a monomial is present in the sum if and only if xi occurs at 
least once, Xi and X2 together occur at least twice etc. 

We have Pq = 1, Pi = Xi, P2 = xl + 2xiX2, P^ = x^ + 3x1x2 + Sxfx^ + 
3x1X2 + 6x1X2X3, etc. In general, the number of terms is equal to the n:th 
Catalan number, and the sum of the coefficients is equal to + These 
polynomials are related to parking functions, trees etc, for which there is an 
extensive combinatorial theory. 

The polynomials P„ are calculated forn < (here we have taken N = 20) 
using the Maple code: 

N:=20; 
P[0] :=1: 
for n to N do 
P[n]: = 

add (binomial (n, n-i)*x[l] "(n-i)*eval(P[i] , 

{x[l]=add(x[k] , k=2..n-i+l), seq(x[j]=x[j+n-i] , j=2..i)}), 

i=0. .n-1) : 



We must avoid using the command simplify on P„, since that will make 
each polynomial roughly four times the size (in terms of memory) of the 
previous one, instead of twice. 

If Xi are the lengths of disjoint subintervals of the unit interval, then 
Pn is the probability that of n independent uniformly chosen points, at least 
one lies in Ji, at least two in Ii U I2, at least three in Ji U /2 U etc. 

We can see how this relates to the last-arrival problem if we assume that 
we have fixed the thresholds Oi, . . . , a„ for a strategy, and we ask for the 




/ 



od: 
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probability of not accepting any item if there are n items. If we have 



ai < a2 < ■ ■ ■ < tt; 



then not accepting any item is equivalent to having at least one item appear- 
ing before time oi, at least two before 02 etc, which has probability 



Then ai^n < Ci2,n < ■ ■ • < «n,n, and not accepting any of n items is equivalent 
to having at least i items appear before time at^n for every i, and therefore 
has probability 



Suppose we fix a number 6. Then we can recursively obtain upper bounds 
bi on the thresholds for a strategy that achieves success probability at least 
6. First we let bi = 1 — 6. This is easily seen to be an upper bound on oi 
by choosing n = 1. Next we consider n = 2 and derive an upper bound on 
02- We label a particular item and estimate the probability of success for the 
selector by n times the probability that the labeled item is the last one to 
arrive and is accepted (and no earlier item is accepted). 

For n = 2 we obtain 



Success (with the labeled item) requires that (a) the labeled item arrives 
after time 02 and (b) that the other item arrives before time oi. Notice that 
we do not necessarily have equality: If 02 < ai then the labeled item need 
not be accepted even if (a) and (b) both hold. From ([2]) it follows that 



P„(ai, a2 — cii, ■ ■ ■ ,an — On-i)- 



To cover also the case that the Oj's are not increasing, we let 



ai^n = mm aj. 





(2) 



02 < 1 




2ai 



If now P(success) > 9 and ai < bi it follows that 
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so we can take b2 — 1 — 9/{2bi). In general we have 

P(success) < n-(l-a„)-P„_i(ai,„_i,Q;2,n-i-"i,n-i, • • • , an-l,n-l-an-2,n-l)- 

Suppose now that we have estabhshed upper bounds 6j on Oj, and from 
them similarly define 



/3i,n = min bj. 

i<j<n 



Then 



9 < P(succcss) 

< n(l — a„) ■ Pn-l{oii^n-l, «2,n-l ~ «l,n-l) ■ ■ ■ ) 0;„-l,n-l ~ CKn-2,n-l) 

< n(l — a„) • Pn-l(/5l,n-l) ^2,71-1 ~ Pl,n-1, ■ ■ ■ , Pn-l,n-l ~ Pn-2,n-l), (3) 

and therefore 

9 

a„ < 1 z r. 

n- ■ Pn-\{^l,n-l, ^2,n-l — /^l.n-l, ■ ■ ■ , Pn-l,n-l — ^n-2,n-l) 

If we recursively define 

Q 

n ■ Pn-l{f3l,n-l, ^2,11-1 — f3l,n-l, ■ ■ ■ , Pn-l,n-l — Pn-2,n-l) ' 

then for every strategy that achieves success probability 9 we must have 
«fc < bk for every k. 

When we actually compute the bounds bk, it is easiest to work with only 
one sequence of /3's that are continuously updated. Each round of computing 
a new 6„ is as follows: 



n ■ Pn-l{(3l, /32- /3i,..., /3n-l - /3n-2) 

For 1 < i < n — 1, 

bn ■= Pn- 

The Maple code is (here with the choice 9 = 1/e): 
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theta: =evalf (exp(-l) ) ; 
beta[0] :=0: 
for n to N+1 do 

beta[n] : =l-theta/n/eval (P [n-1] , 

{seq(x[i]=beta[i]-beta[i-l] , i=l. .n-1)}) : 

b[n] :=beta[n] : 

print (n, b [n] ) : 

for i to n-1 do 

if beta[i]>b[n] then beta [i] :=b[n] : fi: 

od: 
od: 

4.2 Upper bound smaller than 1/e 

If we start from the ansatz 6* = 1/e, we get the upper bounds 



bi = 


0.632120558828558 


b2 = 


0.709011646565337 


h = 


0.753159994034917 


64 = 


0.781938205897688 


h = 


0.801087317290966 




0.812758005512094 


h = 


0.816930118314804 


bs = 


0.810454956408292 


b,= 


0.780396988248096 


bio = 


0.657338420431837 


bn = 


-1.22035626433979 



Here bn is negative, and since no strategy can have an < 0, the conclusion 
is that no strategy for the selector can achieve winning probability 1/e for 
all n. In fact the conclusion is slightly stronger: Even in the restricted game 
for = 11, there is no way the selector can win with probability at least 
1/e. Therefore we have an alternative proof of Proposition 13.61 

As long as the ftj's are increasing, we actually obtain a strategy that wins 
with the desired probability, so we also see that if the devil is restricted to 
1 < 77, < 7, the selector can in fact obtain winning probability 1/e. 
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A little experimentation shows that for 9 = 0.35, the selector doesn't 
seem to be in trouble, while the slightly larger 9 = 0.355 leads to 620 < 0. 
It also seems that as soon as the bi 's start decreasing, they enter a losing 
spiral that quickly leads to a negative value. 

4.3 Refinement 

To obtain a more precise upper bound on 9opt, we would like to extend the 
computation of the fei's to i > 20. With the method described so far, the 
problem is that the polynomials P„ grow exponentially in terms of computer 
memory, preventing us to go much further than to n = 25. Therefore we will 
use an estimate based on the polynomials Pi, ... , P/v for a fixed A^, where in 
practice we have taken = 24. We write for ai^n-i to make the equation 
readable: 

P(success) < n{l — an) 

(4) 

The explanation is straightforward: Suppose that one of the n items 
is labeled. The probability of success is n times the probability that the 
labeled item is last and is accepted. Success with the labeled item requires 
that item to appear in the interval [cin,l]- Moreover, for some i with < 
i < N, we must have exactly i items appearing after time a„_Ar_i_„_i. Of 
those, at least one must appear in [a„_7v-i,n-i5 «n-i,n-i], at least two in 

[oin-N-l,n-l, 0!n-i+l,n-l\ etC. 

We can use (jl]) to compute upper bounds bn in the same way as before. 
The Maple code is: 

for n from N+2 to 1000 do 

beta[n] :=1 - theta/n/add(binomial (n-1 , i) *beta[n-N-l] " (n-l-i) 

*eval(P[i], {x[l] = beta[n-i] - beta[n-N-l] , 

seq(x[j] = beta[n-i-H-j] - beta[n-i-2+j] , j=2..i)}), 1=0. .N): 

b[n] :=beta[n] : 

print (n, b [n] ) : 

for i to n-1 do 
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if beta[i]>b[n] then beta [i] :=b[n] : fi: 
od: 
od: 

With = 24 and = 0.3529170002071958 this gives 6147 < 0. Therefore 
Oopt < 0.3529170002071958. (5) 

4.4 Lower bound 

So far we have not estabhshed any exphcit lower bound on the selector's prob- 
ability of success, although it seems that our upper bound is quite sharp. In 
this section we construct a threshold strategy for the selector that guarantees 
winning probability close to the upper bound ([5]). We restrict our attention 
to strategies for which the sequence of thresholds is nondecreasing. Again 
we start from an ansatz for 9. For n < + 1, we compute the sequence by 

6 

«n = 1 5 r- (6) 

To extend the computations we now need a lower bound on the probability 
of success for the selector. Since we will require the sequence of a^'s to be 
increasing, we do not have to bother with introducing the a's and /5's. 

Proposition 4.1. 

P(success) , 
n 

71 — 1 — i J ~i ( \ 

Proof. Success with a labeled item is equivalent to the following conditions 
being satisfied: 

1. The labeled item appears in [a„, 1]. 

2. For some i with < i < A^, there appears exactly i items in the interval 
[a„_iv-i, and the remaining n — 1 — i appear before an-N-i- 
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3. Of the i items in the interval [an-N-i, c^n-i], at least one arrives in the 
interval [a„_iv-i, ctn-i], at least two in the interval [a„_7v-i, etc. 

4. For the n — 1 — i items that arrive before an-N-i, there is no subset 
of k items for N + 2 — i<k<n — 1 — i that arrive in the interval 

Condition 1 corresponds to the factor (1 — a„), condition 2 to the sum where 
i goes from to N, condition 3 to the occurrence of Pi, and condition 4 to 
the last factor in the summand. That last factor is where in general we do 
not have equality. □ 

For "intermediate" n, we use the recursion 



nElo iVK-N-l^ (l - E^.-. m (l - ■ Pi 

where Pi is evaluated with the arguments as in ([7]). The Maple code is 

for n from N+2 to 1000 do 

a[n] :=1 - theta/n/add(binomial(n-l , i)*a[n-N-l] ~(n-l-i)* 
(1 - add(binoinial(n-l-i,k)*(l-a[n-i-k]/a[n-N-l])~k, 
k=N+2-i. .n-l-i))* 

eval(P[i], {x[l] = a[n-i] - a[n-N-l] , 

seq(x[j] = a[n-i-l+j] - a[n-i-2+j] , j=2..i)}), 1=0.. N): 
print (n, a[n] ) : 

if a[n]<a[n-l] then print ("Darn! ") : fi: 
od: 

With N = 24, this seems to work for 6 = 0.3529170002071955. Next we 
turn to proving rigorously that the strategy works for all n. 

4.5 Construction of a strategy 

We describe a scheme for proving that a certain sequence gives winning 
probability at least 6. Our sequences will be of the following form: For some 
m, the numbers Oi, . . . , — 1 are given by a specific list, and for k > m, 

1 

flfc = 1 - T- 
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We have taken 6 = 0.3529170002071955. If we compute a„ according to 
the previous section, we notice an interesting phenomenon. The quantity 
ri(l — a„) is at first increasing, and seems to stabihze at around 1.29. Then for 
n at around 130 to 160, it suddenly drops to around 0.74 where it stabihzes. 

This phenomenon can be understood at least intuitively. For large n, 
the most important factor for success is that exactly one item must arrive 
after a„. If the probability of success is exactly 6, this confines a„ to a short 
interval around one of the points 1 — xi/n and 1 — X2/n, where xi and X2 
are the two solutions to xe~^ = 6, unless some other reason for failure has 
unusually large probability. The second most important reason for failure is 
if an item arrives between a„_i and a„. This means that the sequence a„ can 
take large steps if it is on the hill between 1 — xi/n and l — X2/n. The strategy 
is in danger of failure as long as a„ ~ 1 — 1.29/n, but if it manages to climb 
the hill at 1 — 1/n it will reach security and stabilize at a„ ~ 1 — 0.74/?7,. 

The first time > 1 — 1/n is at n = 147 and we therefore take m = 147. 
In other words, we construct a sequence a„ by ([6]) for n < 25, by ([8]) for 
26 < n < 146, and by a„ = 1 — l/n for n > 147. In computer language we 
overwrite the old values of a„ for n > 147: 

for n from 147 to 1000 do 
a[n] :=l-l/n: 



We have used ([7]) to verify that with this sequence, the probability of 
success is at least 6^ for n < 750. 

4.6 Proving a lower bound on 9 opt 

Our next task is to prove that the constructed strategy wins with probability 
at least 6 also for large n. Success requires that exactly one item arrives after 
time a„. We start by estimating the probability of this. 

Lemma 4.2. The probability of exactly one item appearing after time an is 
at least 1/e. 

Proof. We estimate it by 



od: 




□ 
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In the following we condition on exactly one item after time a„. With a 
rough estimate, the (conditional) probability that there are at least k (out 
of n — 1) items arriving in the interval [a„_fc, a„] is at most 



n-l\ f ^ an-k^ ^ 



k J \ an 

Conditioning on exactly one item arriving after time a„, the failure proba- 
bility is therefore bounded by 



k=l 



It remains to bound (Q, which naturally we do in two steps. We begin 
with the terms for which a„_fc is given hy 1 — l/{n — k). Recall that this is 
when n — k > m and that in the specific case we have in mind, m = 147. 



Lemma 4.3. For n > m, we have 



k=l 

Proof. For n — k > m we have 



k I \ an I n — 2 



1 ]_ 

^ _ 0"n-k _ ^ n~k 



an 1-^ {n-k){n-l)' 

By an elementary integral estimate, 

k^ 



k\ 



for every k. Therefore 



n~m / 1 \ / \ k n-m , s k 



Differentiating the expression 



k 

e 



n — k 
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(12) 



twice with respect to k gives 



' ^ ^ ^ k V f e V f 2 k 



log — ^ +— ^ + — ^ — ^ + 



n — k J \ \n — k J n — k J \n — k J \n — k {n — k) 

(13) 

which is obviously nonnegative. Hence f|T2|) is convex regarded as a function 
of k. Therefore whenever 

> (-) ' (14) 
that is, when the first term is at least as large as the last term, we have 

El ^ , ) < \- (n-m-1 
\ n — h n — 1 



-1 



fc=l 

Since 



n — k J n — 1 (n — 2)' 



decreases whenever 

n-2> 



it suffices to find a single value of n for which f|T^ is valid, and obviously 
equality holds when n = m + 2. 
Consequently 



k=l 

which in turn is 



n—m \ k -, / -,\ 

72—1^ an-k\ ^ 1 e-\n — m — l) 



k J \ CLn J n — 1 (n — 2) 



1 e-(n-2) e + 1 

< ^ + - 



2 



n-2 (n-2)2 n - 2' 

It is easy to check that this holds also when n = m + 1. The bound (fTOj) now 
follows. □ 

The part of the sum in IQ for which k > n — m is estimated by (assuming 
that n > 2m — 3) 



fc=n— m+l N / ^ / i^~fi—Yn+l ^ 



k 



< [ni - I) . j^^—-- {I - a,)"+\ (15) 



(m-2)! 
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which is decreasing for n > 2m, since 

m — 2 



n > 



log(l - ai)' 



In our case it is smaller than 10 

Putting together these pieces, we obtain 

P(success) > ■ (1 - - 10"^°° 

\ n — 2 

which clearly is larger than our choice of 6 whenever n > 750 (actually much 
earlier). We conclude that, with reservation for purely numerical errors, 

0.3529170002071955 < 9opt < 0.3529170002071958. 



5 The secretary problem for a partially or- 
dered set 

The original motivation for considering the last-arrival problem was the secre- 
tary problem for a partially ordered set. The items (secretaries) are partially 
ordered and presented in random order to a selector, whose objective is to 
choose online a maximal element of the partial order. J. Preater [B] showed 
that there is a strategy that achieves this with probability 1/8, and this was 
improved to 1/4 by N. Georgiou, M. Kuchta, M. Morayne and J. Niemiec |1] 
and further to 1/4 + e by J. Kozik [5]. Recently R. Freij and the author |3] 
found a strategy that achieves success probability 1/e, thereby matching the 
upper bound from the classical secretary problem. 

Before finding the solution in [3j , the analysis of several reasonable strate- 
gies showed that with decent probability they tend to accept the last maximal 
element to arrive. An idea was to consider a maximal element x of the partial 
order, and to condition on all other maximal element arriving before x. As is 
shown below, this type of argument is strong enough to establish a nonzero 
probability of success in the partially ordered secretary problem. 

Naturally this led to the conjecture that the correct generalization of the 
strategy for the classical secretary problem would turn out to accept the last 
maximal element of the partial order with probability at least 1/e. For this 
conjecture it was natural to first consider a completely "unordered" set, and 



22 



this is precisely the last-arrival problem. After some confusion, as described 
in the introduction, the conjecture turned out to be false, and therefore an 
analysis focusing only on the last arriving maximal element cannot establish 
winning probability 1/e for the partially ordered secretary problem. 

Nevertheless, it turns out that there is a strategy that works for an arbi- 
trary partial order, and with probability uniformly bounded away from zero 
picks the last maximal element. The strategy builds on a strategy for the 
last-arrival problem, and uses the following result. 

Lemma 5.1. There is a threshold strategy for the last- arrival problem which 
accepts the last item with probability at least 9i > 0, and with probability at 
least 62 > does not accept any item. 

We have proved the first part of the statement in detail, and it is clear that 
the optimal strategy must also satisfy the second part (so does the strategy 
described in Proposition 12. but it is not a threshold strategy). 

We now apply the following strategy for the partially ordered secretary 
problem: Accept an element if (1) it is maximal in the induced partial order 
of elements seen so far, and (2) arrives at a time later than a^, where k is 
the number of maximal elements in that induced partial order. 

Proposition 5.2. This strategy for the partially ordered secretary problem 
that with probability at least 6162 > picks the last maximal element. 

Proof. We condition on the number m of maximal elements in the partial 
order, and the number q of elements that are dominated only by the last 
maximal element x to arrive. We condition on x arriving in the time interval 
[am, 1]. The probability for this is at least 61. 

With probability at least 62, the m + q — l maximal elements in the partial 
order P — x will arrive early enough to guarantee that none of them, and 
thereby no element at all, is accepted before x arrives (we have conditioned 
on the m — 1 remaining maximal elements arriving before x arrives, but this 
only increases the probability of success). □ 
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